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1  INTRODUCTION 


The  U.S.  global  change  research  program  embraces  three  important 
classes  of  activities.  One  is  the  long-term  monitoring  of  climatic  variables 
to  detect  and  measure  global  change.  A  second  class  comprises  detailed 
short-term  studies  of  the  processes  that  govern  the  climate  system.  The 
third  activity  involves  the  construction  of  climate  models  (e.g.,  ocean  and 
atmospheric  general  circulation  models  -  GCMs)  incorporating  the  results  of 
process  studies  to  improve  the  ability  to  predict  and  eventually  to  respond  to 
global  change.  The  CHAMMP  program  clearly  falls  within  the  third  class  of 
activities  (modeling)  and  complements  DOE’s  Atmospheric  Radiation  Mea¬ 
surement  Program  (ARM)  of  process  studies  in  the  atmosphere,  and  NASA’s 
Earth  Observing  System  (EOS)  program,  aimed  primarily  at  monitoring. 

CHAMMP’s  stated  goal  is  “To  develop,  verify,  and  apply  a  new  gener¬ 
ation  of  climate  model  within  a  coordinated  framework  that: 

1.  incorporates  the  best  available  scientific  and  numerical  approaches  to 
represent  physical,  biogeochemical,  and  ecological  processes; 

2.  fully  utilizes  the  hardware  and  software  capabilities  of  new  computer 
architectures; 

3.  probes  the  limits  of  climate  predictability; 

4.  can  be  used  to  address  the  challenging  problem  of  understanding  the 
greenhouse  climate  issue  through  the  ability  of  models  to  simulate  time- 
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dependent  climatic  changes  over  extended  times  with  regional  resolu¬ 
tion.’’ 

The  JASON  review  of  the  CHAMMP  program  took  place  during  early 
July  1991.  This  review  continues  the  role  that  JASON  htis  played  in  advising 
program  management  and  the  CHAMMP  participants  on  the  scientific  and 
programmatic  issues.  Our  report  from  last  year  can  be  found  in  JSR-90- 
306.  Input  to  the  present  review  came  in  the  form  of  briefings  on  various 
scientific  aspects  of  CHAMMP  by  scientists  involved  in  the  program  and  by 
other  members  of  the  climate  modeling  community  on  July  13,  1991.  We 
also  heard  a  full  day  of  presentations  on  the  accomplishments,  status,  and 
prospects  for  the  program  given  by  the  CHAMMP  management  team  on  July 
12.  Additional  written  material  was  provided  to  us  by  the  CHAMMP  man¬ 
agement.  Several  JASONs  also  have  familiarity  with  the  program  through 
informal  contacts  with  CHAMMP  management  and  scientists  over  the  past 
year. 


Following  a  brief  discussion  of  program  accomplishments  (Section  2),  we 
examine  issues  raised  in  highly  parallel  computing  (Section  3).  We  next  take 
up  the  questions  associated  with  predictability  (Section  4),  and  present  our 
assessment  of  the  program  (Section  5).  Possible  future  directions  of  research 
are  discussed  in  Section  6.  The  links  between  CHAMMP  and  other  elements 
of  the  Department  of  Energy’s  global  change  programs  are  considered  in 
Section  7.  The  report  also  contains  four  technical  appendices,  which  are 
included  to  illustrate  certain  programmatic  issues  discussed  in  the  main  body 
of  the  report.  Appendix  A  presents  one  attempt  to  narrow  the  definition 
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of  what  is  meant  by  predictability  of  climate.  The  questions  of  coherent 
structures  and  spatial  statistics  are  discussed  in  Appendix  B.  A  fluctuation- 
correlation  relationship  is  presented  in  Appendix  C.  Appendix  D  illustrates 
the  potential  contribution  from  mathematics  by  providing  a  mathematical 
argument  on  the  nature  of  extreme  events  in  dynamical  systems  that  can  be 
represented  by  an  attractor. 
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2  PROGRAM  ACCOMPLISHMENTS 


The  first  step  in  the  CHAMMP  program  strategy  is  to  adapt  the  current 
generation  of  GCMs  to  high-performance  parallel  computers.  At  the  time  of 
our  first  review  of  CHAMMP,  the  program  wa,s  just  beginning.  During  the 
past  year,  there  have  been  a  number  of  solid  scientific  accomplishments  that 
bode  well  for  the  future  of  the  program. 

A  major  accomplishment  is  the  implementation  of  the  NCAR  Commu¬ 
nity  Climate  Model  (CCM-2)  on  the  Intel  family  of  parallel  supercomputers. 
This  is  a  full  spectral  atmospheric  GCM  [Legendre  transform  in  the  latitude, 
Fourier  transform  in  the  longitude,  and  finite  difference  in  the  vertical  co¬ 
ordinate]  and  so  is  not  naturally  adapted  to  a  multiprocessor  architecture. 
Nevertheless,  it  is  running  on  both  a  128  node  Gamma  and  a  540  node  Delta 
computer  and  validation  is  underway.  It  is  estimated  that  a  1-year  simula¬ 
tion  at  a  resolution  of  T42  will  take  2  hours  on  the  Delta,  while  the  same 
calculation  at  T213  will  take  50  hours. 

A  second  major  accomplishment  is  the  implementation  of  the  Semtner- 
Chervin  Ocean  GCM  on  the  single  instruction/multiple  data  (SIMD)  CM-2 
computer.  This  is  a  grid-point  model  with  an  eddy  resolving  (0.5°)  horizontal 
resolution,  20  vertical  levels,  and  a  realistic  bottom  topography.  Performance 
comparable  to  that  of  an  8-processor  Cray  YMP  has  been  achieved,  due  in 
part,  to  a  new  method  for  solving  the  surface  pressure  equation.  Further 
improvements  are  expected.  This  same  model  has  also  been  implemented  on 
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the  multiple  instruction/multiple  data  (MIMD)  BBN  parallel  architecture, 
and  there  has  been  an  extensive  study  of  the  efficiency  of  grid-point  methods 
for  a  shallow  water  problem  on  that  architecture. 

We  also  note  that,  within  the  last  year,  the  CHAMMP  program  has 
awarded  the  equivalent  of  13,000  Cray-1  hours,  has  solicited,  reviewed,  and 
awarded  a  number  of  Science  Team  proposals,  and  has  completed  Program, 
Management,  and  Implementation  plans. 
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3  ISSUES  IN  HIGHLY  PARALLEL  COM¬ 
PUTATIONS 


There  has  been  much  discussion  of  a  variety  of  highly  parallel  comput¬ 
ing  hardware  architectures  in  recent  years.  The  biggest  difference  has  been 
between  the  SIMD  architecture  of,  for  example,  the  Connection  Machine  2 
(CM-2)  and  the  MIMD  architectures  of  machines  produced,  for  example, 
by  Intel  and  BBN.  SIMD  is  clearly  more  restrictive  than  MIMD,  but  it  is 
particularly  easy  to  use  for  regular  mesh  calculations,  especially  through  the 
data  parallel  approach  embodied  in  the  new  FORTRAN-90  compiler.  The 
greater  flexibility  of  MIMD  provides  more  freedom  of  choice  in  progranuning 
models,  but  at  the  price  of  complexity.  It  has  long  been  realized  that  a  SIMD 
machine  can  be  easily  simulated  on  a  MIMD  machine,  but  not  the  reverse. 

The  2wlditional  hardware  cost  of  MIMD  over  SIMD  capability  is  negli¬ 
gible,  and  thus  it  is  not  surprising  that,  in  the  next  few  years,  the  evolving 
hardware  will  be  MIMD,  though  with  compilers  it  will  provide  the  ease  of 
SIMD  data  parallel  use.  The  issue  then  will  be  to  decide  the  most  appro¬ 
priate  programming  model  for  a  given  application.  The  two  main  choices 
in  the  MIMD  world  are  to  treat  all  memory  as  though  it  were  shared  by 
each  processor  node,  or  to  approximately  more  closely  the  actual  hardware 
by  treating  the  memory  as  distributed  among  the  individual  nodes,  with  ex¬ 
plicit  message  passing  to  transfer  data  between  the  local  memories  of  the 
nodes. 
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Atmosphere  and  ocean  models  based  on  grid-point  methods  seem  par¬ 
ticularly  well  suited  to  a  two-dimensional  domain  decomposition  message¬ 
passing  programming  model  on  MIMD  machines.  For  this  model,  the  whole 
code  is  subdivided  horizontally  into  regions,  each  of  which  is  assigned  to  a 
single  node.  For  commonly  used  explicit  differencing  schemes,  it  is  then  only 
necessary  to  exchange  border  data  by  passing  messages  between  contiguous 
regions.  The  relative  cost  of  data  transfer  is  diminished  in  this  way  by  a 
perimeter-to-area  ratio.  This  cost  must  be  minimized  for  efficient  use  of 
parallel  hardware. 

There  are  two  advantages  to  such  a  distributed  memory  message-passing 
programming  model.  First,  communications  hardware  is  inherently  used 
more  efficiently  in  packet  mode  than  in  the  transmission  of  single  words. 
Second,  programming  is  far  simpler,  since  the  only  parallel  construct  needed 
is  to  send  or  receive  a  message;  this  is  conceptually  similar  to  the  traditional 
problem  of  writing  to  or  reading  from  a  working  storage  medium  in  programs 
whose  data  requirements  exceed  the  core  storage  capacity.  Care  must  be 
taken  to  choose  a  division  that  balances  the  computational  load  among  the 
processor  nodes. 

This  message- passing  model  is  not,  however,  without  its  drawbacks.  If 
the  message  passing  is  just  a  programming  construct  for  the  code,  it  does  not 
matter  (as  long  2is  a  good  compiler  exists).  But  if  actual  message  passing  is 
done  at  run-time,  there  are  penalties  associated  with  the  additional  expensive 
communications  hardware,  data  copying,  and  message  header  information. 
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Note  also  that  shared  memory  does  not  require  a  static  partitioning  of  mem¬ 
ory. 


A  principal  purpose  of  the  directed  part  of  the  CHAMMP  program  is 
to  move  various  climate  models  onto  various  parallel  architectures  in  order 
to  acquire  the  experience  needed  to  select  the  most  appropriate  match  of  ar¬ 
chitecture  and  programming  model.  In  particular,  this  activity  will  evaluate 
parallel  implementations  of  the  spectral  transform  models  commonly  used 
for  the  atmosphere.  The  Fast  Fourier  and  Gaussian  quadrature  transforms 
in  such  models  are  global  in  nature  and  pose  a  more  difficult  communication 
problem  for  parallel  message-passing  hardware  than  does  the  kind  of  local 
explicit  grid-point  method  described  above.  In  recent  years,  other  problems 
with  spectral  transform  models  have  arisen,  indicating  that  it  may  very  well 
be  time  to  abandon  spectral  models. 
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4  PREDICTABILITY 


The  literature  contains  a  number  of  versions  of  what  is  meant  by  pre¬ 
dictability  in  climate.  Different  authors  imply  different  meanings.  In  the 
following,  we  discuss  one  particular  approach  to  predictability  (see  Appendix 
A  for  a  more  technical  discussion  of  predictability). 

As  is  well  known,  the  atmosphere  is  a  chaotic,  nonlinear  dynamical  sys¬ 
tem  with  a  limit  of  deterministic  weather  prediction  of  about  ten  days  to 
two  weeks.  Any  atmospheric  time  series  of  interest  for  climate  prediction 
must  therefore  be  contaminated  by  an  unpredictable  weather  noise  compo¬ 
nent.  Similarly,  any  atmospheric  GCM  run  for  climate  purposes  is  in  fact  a 
weather  model  for  which  averages  are  extracted,  and  it,  too,  is  contaminated 
by  weather  noise.  Models  that  deal  directly  with  climate  statistics  are  free 
of  weather  noise,  but  suffer  from  a  variety  of  untested  closure  assumptions. 

The  problem  of  weather  noise  in  a  GCM  is  a  problem  in  statistical 
sampling  theory,  and  most  climate  sensitivity  studies  explicitly  estimate  the 
statistical  significance  of  the  result.  This  is  the  simplest  aspect  of  climate 
predictability,  and  involves  the  extraction  of  a  statistically  significant  signal 
from  a  weather  noise  background  that  can  be  reduced  only  as  the  square  root 
of  the  length  of  the  time  series  or,  for  models,  the  amount  of  computation. 
The  implicit  assumption  here  is  that  the  weather  noise  is  “white”  for  fre¬ 
quencies  much  below  a  few  cycles  per  month.  This  white  noise  is  associated 
with  the  fast  physics  and  dynamics  of  the  atmosphere  alone. 


11 


Of  course,  the  climate  system  includes  slow  components  as  well,  and 
one  of  the  first  questions  asked  of  atmospheric  GCMs  was  whether  they 
responded  properly  to  slowly  changing  external  forcing  influences,  such  as 
sea  surface  temperature.  This  is  not  a  question  of  theoretical  predictability 
with  a  perfect  model,  as  discussed  above,  but  rather  a  question  of  practical 
predictability,  considering  the  imperfect  nature  of  the  models.  Clearly,  it  is 
hard  to  estimate  the  importance  of  model  imperfections,  but  an  effort  is  being 
made  through  the  DOE-supported  Program  for  Climate  Model  Diagnosis  and 
Intercomparison  (PCMDI).  Even  if,  in  response  to  the  PCMDI  analysis,  all 
models  are  brought  into  agreement,  there  is  no  guarantee  that  they  do  not 
all  err  in  omitting  unknown,  but  physically  important,  processes. 

The  fast  response  of  the  atmosphere  to  changing  influences  might  be 
estimated  from  the  observed  behavior  of  the  atmosphere  itself,  without  re¬ 
course  to  models,  by  use  of  the  fluctuation-dissipation  relation  of  statistical 
physics  (see  Appendix  C).  The  essential  idea  here  is  that  a  complex  system 
will  recover  from  an  artificially  imposed  impulsive  disturbance  in  the  same 
way  that  it  recovers  from  a  naturally  occurring  fluctuation.  In  this  way,  the 
mean  impulse  response  matrix  is  estimated  as  being  equal  to  the  observed 
statistical  regression  matrix.  The  atmosphere  is  not  in  statistical  mechanical 
equilibrium  so  that  a  fluctuation-dissipation  relation  need  not  hold  exactly, 
but  agreement  between  the  two  estimates  of  the  response  would  considerably 
improve  confidence  in  model  results. 

Spatially  coherent  structures  are  responsible  for  observed  “persistence” 
in  weather  systems  (see  Appendix  B).  As  far  as  climate  is  concerned,  those 
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spatial  structures  that  maintain  coherence  over  periods  of  years  to  centuries 
are  particularly  important.  In  contrast  to  the  days  or  weeks  characteristic  of 
spatial  structures  in  the  atmosphere,  the  coherence  times  for  structures  in  the 
ocean  are  much  longer.  The  most  clearly  recognized  slow  climate  fluctuation 
in  the  coupled  ocean-atmosphere  system  is  the  El  Nino/Southern  Oscillation 
(ENSO),  and  it  would  be  valuable  to  estimate  limits  on  its  predictability. 
ENSO  variability  has  already  been  seen  to  obscure  other  possible  slow  global 
changes. 

A  natural  question  then  arises  about  the  theoretical  limit  on  the  deter¬ 
ministic  predictability  of  the  ocean,  taking  into  account  the  uncertainty  in 
its  present  state  and  its  time-chaotic  behavior.  Does  the  ocean  generate  irre¬ 
ducible  “weather”  noise  that  will  obscure  other  low-frequency  signals?  The 
development  of  global  ocean  models  has  just  reached  the  point  where  the 
models  can  be  used  to  estimate  this  theoretical  predictability. 

Another  rather  disturbing  possibility  is  that,  as  with  so  many  complex 
systems  with  many  effective  degrees  of  freedom,  the  climate  is  in  a  state 
of  self-organized  criticality.  In  this  case,  not  only  is  the  climate  variance 
possibly  divergent  at  low  frequencies,  such  m  1//  noise,  but,  even  worse, 
could  be  made  up  of  a  series  of  unpredictable  catastrophes  of  all  scales  of 
magnitude.  A  recently  studied  protot)q)e  of  such  a  system  arises  as  grains  of 
sand  are  randomly  dropped  on  a  sand  pile.  It  is  clearly  important  to  look  for 
grains  of  sand  in  the  climate  system;  they  might  cause  the  present  climate 
research  program  to  grind  to  a  halt. 
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5  PROGRAM  ASSESSMENT 


There  is  clearly  great  scientific  merit  in  the  CHAMMP  program,  and  it 
is  likely  that  this  activity  will  produce  many  spinoffs  en  route  to  its  stated 
goal.  Among  these  will  be  the  development  of  a  very  important  infrastructure 
for  applying  massively  parallel  computers  to  scientific  problems.  One  can 
expect  that  libraries  of  routines,  visualization  techniques,  database  methods, 
and  general  parallel  computing  expertise  that  will  be  developed  during  the 
course  of  the  program  will  be  of  importance  to  the  entire  scientific  community. 
As  noted  above,  it  is  also  not  clear  what  architectures  are  best  suited  to 
which  problems;  CHAMMP  is  directly  exploring  this  issue,  at  least  for  the 
problems  relevant  to  climate  (fluid  dynamics,  radiation  transport,  database 
construction,  and  archiving).  The  expertise  of  the  DOE  labs  is  particularly 
well  suited  to  these  tasks,  and  one  might  expect  that  they  will  repeat  their 
performance  of  the  1960s  and  1970s,  when  they  defined  and  furthered  the 
use  of  serial  supercomputers. 

Another  important  spinoff  from  CHAMMP  will  likely  be  improved  capa¬ 
bilities  for  numerical  weather  prediction.  The  codes  ported  and/or  developed 
by  CHAMMP  are  very  similar  to  (and,  in  some  cases,  identical  with)  those 
used  for  operational  weather  prediction,  and  the  use  of  massively  parallel  ar¬ 
chitectures  for  that  activity  will  clearly  improve  present  capabilities  to  handle 
both  synoptic  and  seasonal  scales. 

We  are  less  optimistic  about  CHAMMP’s  achieving  its  scientific  goal  of 
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an  improvement  in  climate  modeling.  There  are  a  number  of  reasons  for  this 
skepticism. 

First,  the  spatial  and  temporal  resolution  of  present  or  foreseeable  GCMs 
requires  numerous  parameterizations  of  sub-grid-scale  processes.  Some  of 
these  are  of  clear  importance  (e.g.,  clouds  and  their  effect  on  the  radiation 
balance),  and  their  improvement  is  the  focus  of  intense  activity,  such  as  that 
of  ARM.  However,  it  is  not  clear  which  other  parameterizations  are  crucial 
and  which  are  unimportant.  There  are  surely  as  yet  unimagined  processes 
that  will  have  to  be  incorporated.  A  reliable  climate  model  must  include 
all  of  these,  and  include  them  correctly.  This  is  a  daunting  task,  far  more 
difficult  than  simply  getting  the  models  to  run  fast  on  the  latest  computer 
architecture. 

Second,  the  present  direction  of  CHAMMP  research  (and,  indeed,  of 
most  climate  modeling)  is  a  “brute  force”  approach  in  which  very  high  spa¬ 
tial  resolution  with  very  short  time  steps  (e.g.,  15  minutes)  is  used  to  answer 
questions  about  behavior  on  regional  scales  and  decadal  time  scales.  This 
brute  force  approax:h  may  well  not  be  the  best  way  of  understanding  the  cli¬ 
mate  system.  A  useful  analogy  might  be  the  task  of  describing  the  behavior 
of  a  gas.  The  “brute  force”  approach  would  involve  some  potential  between 
the  molecules  of  the  gas  and  integrating  the  large  number  of  molecular  dy¬ 
namic  equations  with  femto-second  time  steps.  However,  if  the  long  term 
(say,  nanosecond)  spatially  averaged  (micron  or  larger)  behavior  is  of  inter¬ 
est,  then  much  simpler  and  more  transparent  thermodynamic  methods  can 
be  applied  (see  Section  6). 
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Third,  the  two  most  important  components  of  the  climate  system  (the 
atmosphere  and  the  oceans)  have  very  different  dynamical  time  scales  (a  few 
days  versus  decades).  True  climate  dynamics  are  therefore  likely  determined 
almost  exclusively  by  the  ocean,  with  the  atmosphere  serving  to  provide  a 
surface  forcing  varying  stochastically  about  a  mean.  Thus  a  new  paradigm 
beyond  simply  coupled  atmospheric  GCMs  and  ocean  GCMs  is  needed  to 
accommodate  the  fast  physics  efficiently. 
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6  POSSIBLE  FUTURE  DIRECTIONS  OF 
RESEARCH 


In  this  section,  we  note  areas  where  CHAMMP  could  benefit  by  bringing 
in  scientists  who  have  not  been  part  of  the  traditional  climate  modeling  com¬ 
munity.  We  believe  that  the  injection  of  new  ideas  and  concepts  is  essential 
if  CHAMMP  is  to  near  its  stated  goab. 

As  we  have  remarked  earlier,  most  GCMs  (and  many  GCM  practition¬ 
ers)  have  grown  out  of  traditional  numerical  weather  forecasting.  This  has 
led  to  attempts  to  understand  climate  by  using  computer  calculations  that 
repeat  a  very  large  number  of  short  time  steps  of  the  basic  equations  of  fluid 
mechanics  and  radiation  transport.  There  has  been  very  little  work  attempt¬ 
ing  to  exploit  totally  different  approaches.  One  could  imagine,  for  example, 
some  sort  of  analogue  of  thermodynamics  in  climate,  in  which  the  “brute 
force”  GCM  approach  is  analogous  to  solving  Newton’s  equations  for  each 
molecule  in  a  gas. 

No  real  mechanisms  are  currently  available  to  encourage  and  support 
research  directed  toward  totally  new  and  different  ways  of  studying  and  pre¬ 
dicting  climate.  We  suggest  that  there  is  a  need  for  DOE  (and/or  perhaps 
NSF)  to  encourage  efforts  to  break  out  of  the  canonical  GCM  mode  by  sup¬ 
porting  a  series  of  summer  schools  or  workshops  on  climate.  These  workshops 
should  be  focused  as  broadly  as  possible  on  means  other  than  GCMs  to  pre¬ 
dict  the  future.  Two  natural  hosts  for  such  a  series  are  the  Aspen  Center 
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for  Physics  and  the  Institute  of  Theoretical  Physics  in  Santa  Barbara.  One 
should  invite  a  small  number  of  climate  phenomenologists  to  give  the  physi¬ 
cal  basis  of  the  problem,  but  avoid  populating  these  schools  with  many  GCM 
practitioners.  The  rest  of  the  attendees  would  be  people  largely  ignorant  of 
climate  research,  but  drawn  from  fields  that  might  provide  a  basis  for  useful 
new  approaches.  These  might  include  physicists  interested  in  many-body 
physics,  statistical  physicists,  nonlinear  dynamicists,  and  mathematicians 
with  relevant  interests  (see  Appendix  D). 

The  ultimate  goal  of  CHAMMP  -  to  build  a  new  generation  of  climate 
simulation  codes  —  requires  the  best  technology  and  computer  hardware, 
the  use  of  advanced  applied  mathematics  and  numerical  methods,  and  im¬ 
provement  in  model  physics.  The  present  program  has  little  emphasis  on 
applied  mathematics  and  numerical  methods.  The  directed  model  develop¬ 
ment  program  is  almost  exclusively  devoted  to  porting  existing  climate  codes 
to  massively  parallel  architectures.  In  the  non-DOE  science  program,  there 
are  only  two  proposals  out  of  eight  (UCLA/CSU  and  UCSD)  that  appear 
to  develop  genuinely  new  algorithms.  CHAMMP  has  not  involved  strong 
research  programs  in  modeling  fluid  dynamics,  even  those  at  DOE  laborato¬ 
ries.  There  is  no  lack  of  interest  on  the  part  of  these  groups  in  climate  mod¬ 
eling.  For  example,  groups  at  both  Los  Alamos  and  Livermore  are  interested 
in  initiating  research  programs  in  numerical  algorithm  development  specifi¬ 
cally  applicable  to  ocean  and  atmospheric  circulation  problems.  CHAMMP 
should  attempt  to  involve  applied  mathematicians  and  numerical  analysts  in 
its  overall  program. 
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7  LINKS  BETWEEN  ARM,  CHAMMP,  AND 
THE  CORE  PROGRAM 


The  Office  of  Energy  Research  contributes  to  the  national  global  change 
program  through  the  activities  of  three  groups;  ARM,  concerned  with  radi¬ 
ation  and  cloud  measurements  in  support  of  modeling  activities;  CHAMMP, 
centered  on  improving  models  through  improved  software,  hardware,  and  al¬ 
gorithms;  and  the  Core  Program,  which  includes  an  intercomparison  activity 
among  models,  as  well  as  support  for  modeling  activities  at  a  number  of  uni¬ 
versities.  Both  ARM  and  CHAMMP,  which  are  new  programs,  have  clearly 
defined  management  teams  that  appear  to  be  working  effectively  as  far  as 
the  individual  programs  are  concerned.  There  is  no  comparable  management 
team  for  the  Core  Program,  although  W.  Lawrence  Gates  has  the  lead  re¬ 
sponsibility  for  the  intercomparison  program.  Robert  Cess  at  Sunnybrook 
serves  as  a  senior  outside  advisor  to  both  the  intercomparison  activities  and 
some  aspects  of  the  Core  Program. 

In  order  for  these  programs  to  be  cost  effective,  it  is  absolutely  essential 
that  there  be  a  close  coordination  among  them.  Both  ARM  and  CHAMMP 
have  science  teams.  One  means  of  achieving  coordination  is  for  the  two  sci¬ 
ence  teams  to  have  common  members.  Since  the  science  teams  are  composed 
of  principal  investigators  (Pis),  whether  or  not  overlap  is  achieved  depends  on 
the  selection  process,  which  has  a  stochastic  component.  It  might  therefore 
be  advisable  to  have  a  few  members  of  one  science  team  designated  as  mem¬ 
bers  of  the  other  science  team,  or  perhaps  to  have  a  few  non- PI  scientists. 
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interested  in  both  ARM  and  CHAMMP,  serve  on  both  science  teams. 

The  links  of  the  Core  Program  to  ARM  and  CHAMMP  are  less  clear. 
Because  of  the  lack  of  a  well-defined  management  structure,  there  does  not 
appear  to  be  a  natural  way  by  which  the  Core  Program  can  interact  with 
ARM  and  CHAMMP.  The  linkage  is  currently  best  through  the  model  inter¬ 
comparison  program,  since  it  resides  at  Lawrence  Livermore  National  Lab¬ 
oratory  (LLNL),  and  Michael  MacCracken,  also  of  LLNL,  is  chief  scientist 
of  the  CHAMMP  program.  We  do  not  know  of  any  interaction  between  the 
Core  Program  and  ARM. 

In  view  of  the  importance  of  avoiding  duplicate  activities,  it  is  essential 
that  the  Core  Program  be  closely  allied  with  CHAMMP  and,  to  the  extent 
possible,  with  ARM.  This  would  require  designation  of  a  project  manager 
and  chief  scientist  for  the  Core  Program.  These  two  individuals  would  then 
have  the  responsibility  of  working  with  the  ARM  and  CHAMMP  teams  to 
ensure  close  cooperation  among  the  three  programs. 
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A  PREDICTABILITY  OF  CLIMATE 


As  soon  as  one  asks  about  the  predictability  of  climate,  the  question  of 
the  meaning  of  “predictability”  arises.  At  one  level,  this  question  is  relatively 
easy  to  answer,  provided  that  we  have  some  notion  or  measure  of  similarity 
of  climates.  With  such  a  notion  at  hand,  we  can  take  a  climate  predictor 
(such  as  a  GCM),  test  it  experimentally  by  running  it  forward  a  fixed  number 
of  years  for  a  large  variety  of  realized  initial  conditions,  see  how  it  performs 
against  true  climate  using  the  selected  measure  of  similarity,  and  so  produce 
a  satisfactory  numerical  measure  of  its  reliability. 

While  the  above  concepts  are  straightforward,  they  cannot  at  present  be 
implemented.  We  do  not  have  a  satisfactory  notion  of  similarity  of  climates 
(see  Appendix  B)  and  no  such  comprehensive  notion  is  anywhere  in  sight. 
The  best  we  can  do  at  present  (as  is  indeed  commonly  done)  is  to  compare, 
at  a  very  rough  spatial  resolution,  the  temperature  or  other  fields  over  global 
or  some  large  regions.  Furthermore,  the  useful  database  of  results  is  so  small 
relative  to  the  number  of  variables  or  possible  scenarios  as  to  be  ludicrous. 

Of  course,  all  physical  theories  face  the  same  tests  that  we  have  just 
described  for  climate  predictors.  Physical  theories  are  just  predictors  of  the 
behavior  of  the  natural  world,  and  they  are  tested  under  a  wide  variety  of 
circumstances.  When  they  are  badly  stressed  in  certain  regimes,  they  often 
fail  and  have  to  be  replaced. 

Moreover,  many  physical  theories  are  guided  in  their  construction  by 
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concern  for  Ockham’s  razor  —  the  assumptions  going  into  them  are  relatively 
few  and  mostly  visible.  When  they  fail,  it  is  often  possible  to  see  which 
assumptions  were  at  fault.  (Invisible  or  unperceived  assumptions  cause  more 
difficulty.) 

Nothing  like  this  holds  for  climate  models.  The  number  of  assumptions, 
tuned  parameters,  approximations,  etc.,  defy  description.  The  detailed  op¬ 
eration  of  the  model  is  never  seen  —  only  the  computer  “knows”  what  is 
going  on.  And  given  how  difficult  it  is  to  make  these  models  function  satis¬ 
factorily  in  normal  circumstances,  what  grounds  are  there  for  believing  their 
performance  in  stressed  regimes  (such  as  doubled  CO2)? 

Yet  what  can  we  do  about  predicting  climate?  One  can  look  for  some 
sharper  insights  than  those  models  give  us  —  while  they  are  an  audacious 
attack  on  the  problem,  they  are  still  mostly  brute  force  —  and  hope  for  more 
inspired  predictors  based  on  perceiving  general  principles.  But  until  these 
come  along  we  are  stuck  with  GCMs.  (We  do  not  wish  to  suggest,  however, 
that  since  GCMs  are  the  only  tool  available  just  now,  they  are  the  right  tool 
for  now.) 

In  traditional  statistical  theory,  when  a  predictor  for  a  random  variable 
is  given,  it  is  analyzed  by  finding  the  distribution  function  of  the  error,  or  at 
the  very  least,  the  variance  of  the  error.  Nothing  like  this  has  ever  been  done 
in  GCMs.  Even  though  they  are  not  totally  deterministic,  we  are  presented 
with  a  run  and  told,  “This  is  the  climate  prediction.”  It  is,  of  course,  very 
expensive  and  time-consuming  to  run  GCMs  forward  many  years,  but  so  is 
making  incorrect  policy  decisions  based  on  their  predictions.  At  the  very 
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least,  then,  models  should  be  run  with  a  large  variety  of  initial  conditions, 
slightly  offset  tuning  parameters,  etc.,  in  order  to  see  the  change  (or  study 
the  variance)  in  their  predictions.  And  when  clouds  are  put  into  models  more 
realistically,  a  little  stochasticity  in  cloud  distribution  can  be  investigated  as 
well.  We  will  have  to  improvise  some  measure  of  distance  between  climates 
to  quantize  the  variance. 

We  have  been  discussing  one  aspect  of  predictability,  i.e.,  assessing  the 
virtue  of  a  given  predictor.  (We  will  subsequently  give  an  illustration  with  a 
measure  of  similarity  for  general  dynamical  systems.)  Though  even  this  task 
is  somewhere  between  difficult  and  impossible  for  GCMs,  it  does  not  come 
close  to  addressing  the  real  problem  of  predictability  —  is  there  an  optimal 
predictor,  and  what  is  its  behavior  and  reliability? 

Finally,  we  do  not  see  any  way  of  deciding  for  a  general  dynamical 
system  whether  there  is  an  optimal  predictor.  Fortunately,  however,  there 
is  an  optimal  predictor  for  general  stationary  stochastic  processes  in  the 
category  of  linear  predictors  (for  fully  Gaussian  processes  it  is  optimal).  It  is 
measured  by  the  limiting  variance  of  the  Wiener  predictor  of  a  state  as  more 
and  more  of  the  past  is  known.  There  is  a  relatively  simple  theory  described 
in  great  detail  in  texts  [1]  on  stochastic  processes,  which  tells  us  that  the 
limiting  variance  of  the  Wiener  least  squares  predictor  is: 

where  we  are  given  a  stationary,  discrete,  real- valued  stochastic  process  with 
auto  covariance 

Ji(n)  =  F[(xlp)x(p  +  n))] 
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and  spectral  density 

OO 

P(c,)=  •£,  R{n)e^""°- 

n=— OO 

Notice  that  by  Jensen’s  inequality 

\nP{a)d(a)  <  In  P(a)d{a)  =  lni?(0), 

J-l/2  J-l/2 

as  the  limiting  Wiener  predictor  is  strictly  less  than  the  variance  of  the 
individual  random  variables  for  all  processes  except  the  uncorrelated  noise 
process.  Notice  that  if  there  is  a  set  of  positive  measurements  in  which  P{a) 
is  zero,  then  perfect  prediction  is  possible.  Thus  if  the  pov/er  spectrum  is  not 
broadband,  we  are  in  good  shape. 

Unfortunately  log  P{Q)d{a)  is  very  sensitive  to  estimation  of  P{ct) 
when  the  latter  is  small,  so  the  spectral  density  has  to  be  compiled  care¬ 
fully.  It  might  be  advisable  to  allow  for  error  in  measurement  amounting 
to  a  superimposed  uncorrelated  noise  process  of  power  a^,  in  which  case  the 
power  spectrum  should  be  replaced  by  P{q)  -b  Choice  of  depends  on 
circumstances. 

Just  for  illustration,  we  computed  this  measure  of  predictability  for  the 
Lorenz- 27  model,  using  a  25,521 -year  temperature  record  it  produced,  and 
for  the  climate  using  a  134-year  temperature  record,  the  latter  being  quite 
shaky  for  accurate  estimation  of  P{oc).  The  results  were  in  the  first  case  a 
variance  of  and  in  the  second 

The  conclusion  for  the  Lorenz  is  not  after  all  too  surprising.  It  is  a 
deterministic  process,  and  an  infinite  record  should  be  enough  to  determine 
the  dynamics  precisely. 
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There  are  also  formulas  for  variance  of  prediction  at  lags  greater  than 
one,  but  they  are  somewhat  more  complicated  and  we  refer  the  curious  reader 
to  the  literature  [1].  Computing  the  variance  of  the  Wiener  predictor  for  a 
fixed  finite  past  is  an  easy  exercise  in  linear  algebra. 

It  is  worth  noting  that  the  optimal  linear  predictors  we  have  described 
above  are  not  in  all  cases  optimal  predictors.  It  is  known  that  there  are  (non- 
Gaussian)  stationary  stochastic  processes  that  are  not  perfectly  predictable 
linearly,  but  for  which  a  perfect  nonlinear  predictor  exists. 

Next  we  want  to  describe  briefly  predictability  for  dynamical  systems, 
solely  in  terms  of  measurement  error.  Here,  not  unexpectedly,  the  Kol¬ 
mogorov  entropy  and  Lyapounov  exponents  play  important  roles,  yet  in  many 
ways  the  conclusions  are  misleading  —  a  point  we  will  discuss  more  carefully 
later. 

So  let  us  suppose  we  have  a  vector-valued  dynamical  system  Xt,  with 
dynamics  completely  known.  The  vector  variable  is  observed  at  time  t  to 
have  value  at.  How  do  we  predict  the  value  Xt+r,  on  the  assumption  that  all 
our  observations  are  subject  to  a  uniform  observation  error  e?  What  we  are 
saying  is  that  xt  is  a  random  variable  uniformly  distributed  in  a  ball  of  radius 
€  centered  at  at.  Now,  of  course,  the  natural  predictor  for  Xt+r  is  Fr(at), 
where  Fr  is  the  deterministic  evolution  of  the  system  at  interval  r.  But  we 
suppose  we  have  the  same  uniform  me<isurement  error  of  radius  e  centered 
at  Fr(at).  With  these  conventions,  we  can  give  precisely  the  probability  that 
our  predictor  is  correct.  It  is  just  the  relative  meetsure  in  the  ball  of  radius 
c  centered  at  at  of  those  points  that  land  r  time  units  later  in  the  ball  of 


27 


radius  e  centered  at  Fr{at). 


In  order  to  assess  the  merit  of  this  predictor,  we  should  average  in  some 
sense  (probably  with  respect  to  an  invariant  measure)  over  the  position  at 
of  the  initial  point,  but  on  the  assumption  that  the  system  is  ergodic,  we 
shall  see  that  such  averaging  is  not  required,  if  we  take  the  limits  c  — »  0 
and  T  — »  oo.  While  these  kinds  of  asymptotics  are  customary  in  the  study 
of  dynamical  systems,  it  is  important  to  point  out  that  they  are  unrealistic 
for  the  problems  we  really  face.  But  the  problem  of  computing  predictor 
reliability  for  reasonable  c  and  t  requires  massive  computation  of  orbits  and 
choice  of  initial  points.  These  are  precisely  the  kinds  of  computations  that 
are  not  made  for  climate  models,  but  that  are  sorely  needed. 


Let  us  go  back  to  the  computation  we  need.  We  want  to  compute: 

/  |je~a|<c  ...  dxytj  j  (ix\dx2 ...  ^Xfi» 

''\Fr(s)-Fria)\«^  y|x-o|<. 

Now  to  a  first  approximation  Fr{x)  =  /V(a)  4-  Lt{x  —  a),  with  Lt  being  the 
Jacobian  of  Fr(a:)  at  x  =  a.  So  to  a  first  approximation  the  numerator  is 


/  (»|<e  dx  . 

J\Lx\<{ 

The  approximations  are  good  in  the  limit  c  — »  0.  The  integral  above  is  some¬ 
what  unpleasant  to  evaluate  exactly,  so  we  will  approximate  it  by  another 
volume.  Suppose  L*L  has  eigenvalues  Aj,  A2, . . . ,  A„.  The  volume  we  want  is 
the  intersection  of  the  ellipsoids: 


xl  +  xl  +  ...  +  xl<e^ 


Aix?  +  A2I2  -1- . . .  A„x^  <  , 
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which  we  will  replace  by  volume  of  corresponding  rectangular  parallelepipeds: 

|xi|  <  e,...,lx„|  <  £ 

\yjxlxi  I  <  C, .  .  .  ,  <  €. 

This  is  easily  computed  as 


e" 


and,  upon  division  by  the  adjusted  volume  of  the  ball,  gives  the  probability 
in  question  as 

X 

Now  for  large  t,  the  A*’’  are  the  exponentials  of  the  Lyapounov  exponents 
and,  by  Pesin’s  Theorem,  the  sum  of  the  positive  Lyapounov  exponents  is  the 
Kolmogorov  entropy  H,  so  that  approximately,  for  small  c  and  large  r,  we 
have  the  probability  of  correct  prediction  is  This  vanishes  very  rapidly 
with  increasing  t,  if  there  are  positive  Lyapounov  exponents.  (The  same  or 
a  similar  result  can  be  obtained  from  a  variety  of  other  points  of  view,  such 
as  rate  of  decay  of  information.) 

As  we  noted  above,  the  asymptotics  c  -♦  0,  and  r  oo  are  unrealistic 
in  practice.  There  is  an  inescapable  error  of  measurement  in  climatology 
(molecular  rather  than  atomic)  and  predictability  is  really  only  of  interest  at 
relatively  short  time  scales,  say  20-50  years. 

The  asymptotic  statements  of  the  well-developed  parts  of  a  theory  of 
dynamical  systems  are  not  really  pertinent  to  the  problem  of  climate  predic¬ 
tion.  They  have  given  us  a  language  for  descriptive  purposes,  and  an  inkling 
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of  the  complexity  of  behavior  possible,  but  long-term  dynamics  is  still  largely 
a  mystery. 

There  is,  however,  one  sense  in  which  a  general  theory  of  dynamical 
systems  may  be  of  genuine  assistance.  We  are  generally  comfortable  making 
predictions  in  dynamical  systems  having  no  positive  Lyapounov  exponents, 
and  when  we  form  long-term  running  averages,  in  a  sense  we  are  trying  to 
depress  the  exponents.  The  question  this  naturally  raises  is  this:  how  does 
one  suppress  (or  depress)  the  large  Lyapounov  exponents?  More  precisely, 
given  a  dynamical  system  Xt,  are  there  interesting  functions  H  of  k  variables 
so  that  the  derived  system 

Vn  ~  •  •  »3fn+ik— l) 

has  only  small  exponents? 
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B  SPATIAL  STATISTICS 


Spatieil  structures  spontaneously  arise  in  spatially  extended  nonlinear 
dynamical  systems  as  a  consequence  of  the  nonlinearity  of  the  equations  of 
motion.  These  structures  can  be  quite  robust  against  perturbations  and  sur¬ 
vive  for  long  periods  of  time,  even  when  no  comparably  long  time  scales 
appear  to  be  present  in  the  equations  of  motion  (persistence).  Robust  spa¬ 
tial  structures  can  persist  even  though  the  fluid  flow  is  turbulent.  To  a  large 
extent,  these  spatial  structures  determine  the  long-time  behavior  of  the  sys¬ 
tem. 


The  persistent  spatial  features  in  atmospheric  flows  are  certainly  those 
we  see  in  weather  maps:  high  and  low  pressure  systems,  warm  and  cold  fronts, 
etc.  Highs  and  lows  do  not  survive  for  long  periods  of  time,  but  they  survive 
long  enough  to  provide  the  phenomenon  of  persistence  (see  JASON  Report 
JSR  91-340).  The  most  important  long-lived  nonlinear  spatial  structures 
in  the  atmosphere  are  probably  the  systems  of  large-scale  convective  cells, 
which  transport  heat  away  from  the  tropics  toward  the  North  and  South 
Poles.  Our  knowledge  of  spatial  structure  in  the  ocean  is  less  than  that  in 
the  atmosphere,  due  to  the  difficulty  of  obtaining  data.  The  primary  example 
of  a  long-lived  structure  in  the  ocean  may  be  the  so-called  “Conveyor  Belt” 
that  flows  deep  down  the  center  of  the  Atlantic,  around  Antarctica,  into 
the  Pacific  Ocean,  under  India,  and  up  along  Africa  and  Europe.  Other 
examples  include  various  persistent  currents,  such  as  the  Gulf  Stream.  All 
of  these  spatial  structures  have  a  long-term  effect  on  climate.  The  principal 
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GCMs  should  be  able  to  tell  us  how  spatial  structures  change  with  changing 
atmospheric  composition. 

The  principal  products,  and  in  some  cases  the  only  products,  of  GCMs 
are  maps  representing  two-dimensional  fields  over  the  surface  of  the  earth. 
Calculated  fields  are  compared  with  observed  fields  with  the  usual  incanta¬ 
tions:  “The  maps  look  similar,  or  “They  do  not  look  at  all  alike.”  People 
have  been  drawing  maps  and  studying  spatial  pcirallels  for  millennia.  Com¬ 
puters  have  drawn  maps  for  decades.  Far  less  attention  has  been  paid  to 
analyzing  these  spatial  patterns  in  terms  of  statistics  than  has  been  paid 
to  the  analysis  of  time  series.  The  human  eye  and  brain  form  a  wonderful 
mechanism  to  analyze  and  recognize  patterns.  However,  they  are  subjective 
and  likely  to  err,  perhaps  in  the  wished-for  direction. 

Given  the  importance  of  intercomparison  of  two-dimensional  data  fields, 
it  is  essential  to  develop  ways  of  translating  these  fields  into  a  few  numbers 
that  provide  a  clear  and  useful  summary  of  the  information  contained  within 
these  fields.  The  development  of  such  statistics  would  be  essential  in  a  rig¬ 
orous  analysis  of  how  well  GCMs  compare  with  each  other  and  with  reality. 

Spatial  statistics  are  likely  to  differ  from  time  series  statistics  in  a  num¬ 
ber  of  important  ways.  In  time  series  statistics,  the  data  very  often  show 
only  short-range  dependence;  the  auto  correlation  drops  off  very  sharply.  If 
one  examines  spatial  patterns  such  as  meteorological  pressure  fields  or  topo¬ 
graphical  elevation,  it  is  quite  clear  that  the  spatial  fields  show  long-range 
dependence.  This  implies  that  many  of  the  approximations  used  in  time 
series  analysis  that  require  the  absence  of  long-range  dependence  are  not 
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applicable  to  spatial  statistics. 


Continuity  is  a  concept  of  great  importance  in  time  series  analysis. 
When  one  goes  from  continuous  to  finite  sample  data,  the  requirements  for 
stationarity  carry  over.  With  spatial  data,  a  sampling  scheme  may  alter  sta- 
tionarity  and  isotropy.  Very  great  care  is  thus  required  to  translate  results 
obtained  from  continuous  variables  to  results  applied  to  variables  sampled 
over  finite  spatial  integrals. 

In  time  series  analysis,  it  is  very  clear  how  one  carries  out  the  transi¬ 
tion  to  obtain  asymptotic  results.  This  is  not  the  case  in  spatial  problems. 
Consider  the  finitely  sampled  spatial  region  with  m  x  n  sample  points.  For 
asymptotic  results,  clearly  either  m  or  n  should  tend  to  infinity,  but  very  lit¬ 
tle  can  be  concluded  about  the  ratio  m/n.  This  feature  appears  in  a  sensitive 
way  in  such  questions  as  determining  the  asymptotic  mean. 

With  the  advent  of  computers,  a  great  deal  has  been  done  in  the  area 
of  pattern  recognition.  At  the  same  time,  very  little  progress  has  been  made 
in  answering  such  questions  as  what  kinds  of  statistics  should  be  used  to 
compare  the  “likeness”  of  different  spatial  patterns.  Because  spatial  patterns 
are  a  primary  result  of  GCMs,  we  believe  that  a  research  effort  should  be 
mounted  to  pursue  questions  related  to  spatial  statistics. 
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C  A  FLUCTUATION-CORRELATION  RE¬ 
LATIONSHIP 


C.l  Introduction 


It  is  often  useful  to  have  a  way  to  estimate  the  way  a  nonlinear  sys¬ 
tem,  such  as  the  climate  or  weather  systems,  will  respond  to  variations  in 
the  forcing  function.  With  something  like  the  dynamics  of  climate,  we  are 
interested,  for  example,  in  the  way  a  dynamical  variable,  such  as  the  global 
average  temperature,  will  change  if  some  quantity  such  as  the  solar  constant 
or  the  CO2  concentration  or  whatever  is  slightly  varied.  In  this  section  we 
derive  a  relationship  between  some  correlation  functions  of  dynamical  vari¬ 
ables  of  a  nonlinear  system  and  the  variations  of  that  dynamical  variable 
with  respect  to  a  fluctuating  force.  No  assumptions  about  the  statistical 
state  of  the  nonlinear  system  are  required,  though  we  do  assume  that  the 
force  fluctuates  as  a  Gaussian,  white  process. 

The  traditional  fluctuation-dissipation  theorem  relates  correlation  func¬ 
tions  to  rates  of  relaxation  to  an  “equilibrium”  state.  To  see  that  relationship 
in  action,  we  consider  the  simplest  Langevin  equation  for  a  linear  process: 

q{t)  - -aq{t) ij{t),  (C-1) 

where  a  >  0  is  a  dissipation  rate,  is  the  dynamical  variable,  and  r}{t)  is 
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Gaussian,  white  noise  with  zero  mean  and 


<  T){t)  »/(<')  >=  —  <').  (C  -  2) 

The  solution  to  the  differential  equation  is  easily  seen  to  be 

q{t)  =  exp[-a<]9o  +  /  dt’  exp(-a(<'  -  (C  -  3) 

Jo 

This  leads  to 

<  9(09(0  >=  (C  -  4) 

when  transients  have  died  away.  If  we  identify  the  average  of  the  velocity 
squatred  with  (ibfl  is  Boltzmann’s  constant  and  T  is  the  temperature  of 
the  “bath”  in  which  the  system  is  immersed),  then  we  have  which 

relates  the  fluctuations  to  the  dissipation  a.  There  is  a  clear  assumption 
of  the  dynamical  degrees  of  freedom,  here  just  q{t)y  having  come  into  “equi¬ 
librium”  with  a  heat  bath  of  temperature  T.  Basically  this  means  that  the 
fast  degrees  of  freedom,  represented  by  q{t),  are  thermally  driving  the  slow 
degrees  of  freedom  (^(O)- 
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C.2  A  Fluctuation-Correlation  Relationship 


Here  we  start  with  a  general  one-dimensional  nonlinear  equation  of  mo¬ 
tion.  The  result  holds  for  d-dimensional  equations  as  well  with  a  shift  in 
notation  and  the  introduction  of  some  correlation  tensors.  Skipping  these 
tensor  niceties,  we  assume  that  the  dynamics  of  q{t)  satisfies 

^  f  («(<)) + m,  (c  -  5) 

and  that,  once  again,  r}(t)  is  Gaussian,  white  noise  with  a  standard  deviation 


We  introduce  the  probability  density  in  q  space  p(9t|9's),  which  for  s  >  t 
is  the  probability  that  at  time  s  the  system  is  at  q'  if  it  was  at  q  at  time  t. 
This  is  a  kind  of  “propagator”  for  the  system. 

A  functional  integral  representation  for  this  propagator  is  given  by  the 
following  reasoning  [see  B.  Jouvet  and  R.  Phythian,  Phys.  Rev.  A19,  1350 
(1979)].  Any  function  of  q,5'(q)  can  be  represented  by 

S{l)  =  j  D[qW)S{,  -  (C  -  6) 

where  this  is  a  functional  or  path  integral.  This  involves  the  solution  q{t) 
along  the  paths  in  the  integral,  so  we  switch  to  an  integral  that  uses  the 
equations  of  motion  as  a  constraint  to  keep  the  orbits  in  q{t)  space  along 
allowed  paths.  As  argued  in  the  paper  cited,  we  have 

9{<l)=  1..  ^l9l^bW)exp[i/  dt'X,(<')],  (C  -  7) 

J,(o)=«,  Jo 
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with  an  effective  Lagrangian 


L{t)  =  p{t){q{t)  -  F(q{t))  -  Tj(t)),  (C  -  8) 

and  the  “momentum”  is  a  Lagrange  multiplier  that  enforces  the  equations 
of  motion.  The  advantage  of  this  formulation  is  that  the  variation  of  any 
function  of  q^g{q),  with  respect  to  the  fluctuating  quantity  is  seen  to  be  an 
average  over  a  factor  p{t)  in  the  functional  integral  (up  to  a  factor  of  -  *). 
Thus  we  have 

J  D[q']D[p]q\ti)p{t2)exp[i  dt'L(t%  (C  -  9) 

Next  we  introduce  a  “Schrodinger”  representation  in  which  factors  of  p 
become  just  ^  and 

p(9i<iM2)oc/  D[q]D\p]exp[i  f  ^  L{t)dt].  (C  - 10) 

Now  one  can  carry  out  the  averages  over  the  fluctuations  of  q{t)  and 
arrive  at 

.  (C  -  11) 

Similarly 

{9{U)q{t2))  =  J <^92<7iP(9i^iM2)92P(92^2|gofo).  (C  -  12) 

Along  with  the  Schrodinger  representation  is  a  “Schrodinger  equation,” 
namely  the  Fokker-Planck  equation  for  these  processes.  We  have  the  “for¬ 
ward”  equation 

M<Iih\q2t2)  ,  d{F{q2)p(qiti\q2t2))  (r^d^p{qiU\q2t2) 

dt2  dq2  “2  dq^  ’  ^  ^ 
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and  the  “backwtird”  equation 


dp{q\ti\q2h)  .  t:,,  _  ^dp{qxtx\q2t2)  d‘^p{qxtx\q2t2)  ,r^ 

— % —  +  a,;r'  ‘ 

Using  these  two  differential  equations  for  the  probability  densities  in  the 
expression  for 

MtMh))  =  Jdq,j <(,„„,[^®p(2|0)  +p(l|2)^^),  (C  -  15) 
we  arrive  after  a  few  integrations  by  parts  at 

(,((,),(<,))  =  (,((,)F(, ((,)))  +  (C  -  16) 

which  is  our  “fluctuation-correlation”  relation.  It  relates  properties  of  the 
equation  of  motion  in  the  presence  of  the  fluctuating  forcing  q(t)  —  namely, 
in  our  context,  climate  evolution  in  the  presence  of  fluctuating  forcing  due  to 
weather  —  to  variations  of  the  dynamical  quantities  of  climate,  here  repre¬ 
sented  by  q{t),  with  respect  to  changes  in  that  forcing.  This  relation  could, 
of  course,  be  tested  by  explicit  GCM  calculations  and,  if  successful  could  be 
used  to  more  efficiently  estimate  the  sensitivity  of  results  to  change  in  the 
external  forcings. 
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D  CONFIGURATION  SPACE  VOLUMES 
AND  THE  STATISTICS  OF  EXTREMES 


Geodesic  flow  on  a  Riemannian  manifold  is  a  classical  example  of  a 
deterministic  dynamic  system.  A  point  on  the  manifold,  with  local  coordi¬ 
nate  9,  determines  a  configuration.  A  (co)-tangent  vector  P  specifies  the 
dynamic  or  state  and  the  geodesic  line  along  (dual  to)  P  describes  the  time 
evolution  of  the  state.  The  case  that  the  manifold  Af”  is  noncompact  but 
still  has  finite  Riemannian  volume  is  particularly  interesting,  since  we  may 
regard  excursions  far  away  from  some  baise  point  ♦  €  M  as  representing  ex¬ 
tremes  whose  statistical  behavior  can  be  studied  using  the  finite  invariamt 
Riemannian  measure  on  the  unit  tangent  bundle  T\  M.  Locally  this  measure 
is  proportional  to  dvol^  x  dvols”-i  where  the  second  factor  is  the  homoge¬ 
neous  volume  element  on  the  (n  —  l)-dimensional  sphere.  It  is  a  classical 
theorem  of  Liouville’s  that  geodesic  flow  :  TiM  T^M  preserves  this 
measure. 

Interesting  examples  of  such  M  are  obtained  as  SO{2)  \  PSL{2,C)  /  P 
where  F  is  a  con-finite  volume  (but  not  co-compact)  and  torsion  free.  (The 
assumption  that  F  contains  no  torsion  makes  the  quotient  a  manifold,  rather 
than  an  orbifold.  Except  for  notation,  our  discussion  would  be  nearly  identi¬ 
cal  in  the  orbifold  setting  so  the  reader  may  drop  the  additional  hypothesis. 
If  this  is  done  the  best  known  example  is  F  =  PSL{2^  Z).  If  one  prefers  to 
work  with  genuine  surfaces,  F  contains  many  torsion  free  subgroups  of  finite 
index  F'  C  F.) 
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We  may  formulate  the  following  elementary  theorem  on  the  excursions 


of  geodesic  flow  on  M. 


Theorem  1.  Let  /(x)  =  volume  {M\BaUx{*))  and  let  h{t)  be  any  func- 

OO 

tion  (called  “excursion  limiter”)  h  :  such  that 

t=i 

nite.  For  almost  every  *'  €  M,  the  set  of  tangent  directions  v  such  that 
dist{*,  >  h{t)  for  infinitely  many  i  has  Lesbeque  measure  zero,  (^"(y) 

=  proj  {TiM  — >  that  is,  the  point  y  flows  to  in  direction  v  after 

time  t.) 


Proof;  Let  Si,i  =  1,2,  •••,  be  the  subset  of  TiM  that  will  flow  after  time 

i  outside  of  •)(*)).  Since  $,•  is  volume  preserving, 

vol{Si)  =  f{h{i)).  Let  the  characteristic  functions  for  Sj  and  xn  = 

N 

Ex'- 

j=l 


By  hypothesis 

/  X*  =  /  XN  -  E  /  X*  =  finite. 

t=i  •'  •'  t=i  ■' 

By  the  monotone  convergence  theorem: 

lims  J  XN  =  I  litriN  XN 


It  follows  that  lirrif^r  xn  exists  (and  is  finite)  except  on  a  set  E  of 
measure  zero  in  T^M.  But  (y,  u)  €  E  if[  geodesic  flow  from  y  in  direction  v 
satisfies  the  theorem’s  inequality  infinitely  often.  By  Fubini’s  theorem  almost 
all  directions  v  at  almost  all  points  *'  satisfy  (♦',u)  €  E. 


We  return  to  our  examples:  finite  area  hyperbolic  surfaces  with  cusps 
(50(2)  PSL{2,C)  /  F).  In  this  case  it  is  known  from  Margulis’s  analysis 
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of  cusps  [2]  that  /  is  proportional  up  to  universal  constants  to  e~^.  For  any 

t=i 

c  >  0  we  find  an  excursion  limiter  h(t)  =  (!  +  £)  log  t.  (Since  *  = 

OO 

‘=1  1 

“i+T  converges.)  Thus  it  is  vanishingly  rare  to  find  a  geodesic  ray  on  a 

OO  * 

finite  volume  hyperbolic  surface  that  heads  toward  infinity  appreciably  faster 
than  log  t. 


One  may  ask  whether  or  not  we  should  expect  excursions  to  occur  at  the 
maximal  rate  consistent  with  the  preceding  volume  considerations  (theorem 
1).  The  answer  is  “yes”  if  the  mixing  properties  of  the  flow  are  sufficient  to 
make  individual  penetrations  of  the  ends  essentially  independent  in  the  sense 
of  the  Borel-Cantelli  theorem  [3].  In  searching  the  literature,  we  discovered 
that  Sullivan  [4]  has  obtained  this  t)T)e  of  result  for  finite  volume  hyperbolic 
manifolds  of  any  dimension  =  d.  Let  us  write  v{t)  for  the  image  in  M  of 
geodesic  flow  along  v  for  time  t. 


Theorem  (Sullivan).  For  geodesic  flow  on  a  finite  volume  hyperbolic  man¬ 
ifold  M**  we  have: 


lim  sup 

<— *oo 


dist(*,v(t)) 
log  t 


=  l/(d-l). 


for  almost  all  v  6  T^M. 


We  give  —  in  outline  —  an  argument  near  Sullivan’s.  (We  have  at¬ 
tempted  to  match  his  notation  where  possible  and  have  restricted  to  dimen¬ 
sion  d  =  2  for  convenience.) 

Lemma  1  (Borel-Cantelli).  Let  Ai  be  subsets  of  a  measure  space  i  = 
1,2,3....  Let  Aoo  =  {^1^  €  Ai  for  infinitely  many  i}.  Suppose  there  is  a  c  >  1 
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so  that  for  i  ^  j\Ai  H  i4j|  <  |i4i|  |i4j|.  Then  |A|  =  oo  iff  |/loo|  >  0. 


Proof:  The  “only  if”  direction  is  the  nontrivial  one. 

As  before,  let  xn  be  the  sum  of  characteristic  functions  on  A,,  z  <  N. 

/  Xn  <  H  l^«l  +  <c([xn) 

■'  i<N  i<j<N  ' 

SO  II  XN  ||2<  y/c  II  Xn  Ill- 

Set  ipN  =  Xn  I  II  Xn  ||i  let  V’  be  a  weak  limit  in  the  ball  of  radius  ^ 
in  L^.  Since  (V’rnl)  — ♦  {i’A)  we  have  |^|i  =  1.  The  limit  limAf  |x/v|i  =  oo 
by  hypothesis  so  xn  must  approach  oo  on  support  rl>  —  A.  Thus  A  is  a  set 
of  positive  measures  that  certainly  lies  inside  Aqo- 

We  now  need  to  exploit  hyperbolic  geometry.  Margulis  used  the  Anosov 
structure  of  the  geodesic  flow  on  H**  to  prove  various  ergodicity  and  mixing 
assertions.  His  investigation  gives  the  following  information  on  the  (Eu¬ 
clidean)  distribution  of  horoballs  in  the  Poincare  disk  model  of  hyperbolic 
space. 

Lemma  2.  Let  M  be  a  hyperbolic  surface  of  finite  volume.  The  cusps 
have  neighborhoods  that  lift  to  a  disjoint  union  of  horoballs  {Bs)  in  the  unit 
disk  D^.  These  disks,  B,  are  disjoint,  tangent  to  dD^,  are  copious  in  all  size 
scales.  The  last  assertion  says  there  is  a  universal  constant  O  <  p  <  1  so  that 
there  are  approximately  (up  to  further  universal  multiplicative  constants)  p"” 
horoballs  of  diameter  in  the  range  (p”,p""*). 

A  geodesic  on  M  lifts  to  a  boundary-orthogonal  semi-circle  C  in  D.  As  C 
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comes  in  toward  the  ideal  boundary,  we  may  assume  locally  (only  a  very  small 
error  is  introduced  here)  that  we  are  seeing  a  ray  coming  perpendicularly  into 
the  x-axis  in  the  upper  half-space  model. 

In  the  half-space  model,  we  note  that  penetration  toward  a  cusp  is 
comparable*^  to  as  indicated  in  Figure  1. 


Figure  1. 

As  in  Lemma  2,  parse  the  Bs  according  to  the  scale  of  the  diameter 
d  €  Let  5„,  n  >  1,  be  the  union  of  all  Ba  of  scale  n  (as  above). 

Let  An  be  the  disjoint  union  of  intervals  in  [0,1]  of  diameter  centered 

at  the  point  of  contact  between  Sn  and  the  real  line. 

**Mean8  the  ratio  is  bounded  above  and  below  by  universal  constants. 
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OO  _ J  OO  J 

According  to  Lemma  2  |A„|  is  comparable  to  „  so  y3|A„|  ~  —  = 

Si  fog  p  n 

OO.  Furthermore  the  “independence”  hypothesis  of  Lemma  1  will  be  verified 
in  the  next  paragraph. 


Suppose  m>  n.  Ignoring  fixed  constants,  how  many  intervals  of  y4„  can 
meet  an  interval  of  Because  the  smaller  intervals  have  p”*  spacing  and 
the  larger  have  size  =  max  (l,  is  an  upper  bound.  In  fact, 

we  claim  that  if  the  two  intervals  meet,  then  (up  to  universal  multiplicative 
constants)  >  1. 


N/ - 

V^_pn 


-togp” 


Figure  2.  Geometry  of  Am  meeting/^. 


From  the  geometry  above.  Figure  2, 
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so 

—logp'^  \log  /)’"/ 

therefore 

P"  (  -1 

p"*  \  log  /J^" 

With  the  above  maximum  replaced  (up  to  a  fixed  multiplicative  con¬ 
stant)  by  the  second  term  alone,  we  have  an  upper  bound  for  |/1„  fl  Am\ 
based  linearly  on  the  length  of  the  larger  intervals  and  the  spacing  of  the 
smaller.  This  yields  the  required  “independence": 

\Arr,C\A^\<C\Arn\  Kj. 

It  now  follows  from  Lemma  1  that  Aoo  has  positive  measure.  Referring 
back  to  the  Poincare  model,  we  have  the  analogous  sets  Ai  C  dD.  It  does 
not  seem  to  be  possible  to  make  these  sets  entirely  natural  (i.e.,  F  invariant) 
since  distance  into  a  cusp  is  only  definable  up  to  constants,  but  one  can,  by 
replacing  log  p"  by  {l  +  e)log  p”,  produce  large  and  small  versions,  dD  D  AJ , 
with  '){A~i)  C  A'*'i  for  all  7  €  F.  Now  define  A^  =  {x  €  dD\x  €  Af  for 
infinitely  many  i  >  1}.  Again  7^4^  for  all  7  €  F.  And  as  above  |i4^|  >  0. 
This  together  with  the  topological  transitivity  of  F  on  dD  implies  |i4^|  =  1. 

Thus  almost  all  trajectories  penetrate  a  {B,B')  pair  (see  Figure  1), 
where  B  has  size  k  (i.e.,  diameter  €  (p*,p*~*)  and  B'  is  smaller  by  a 
factor  of  {ziogpn)  infinitely  many  k.  The  time  it  takes  for  the  trajec¬ 
tory  to  reach  B'  is  up  to  an  additive  constant:  t  =  —log{_i^g  ^).  The 
cusp  penetration  achieved  with  this  excursion  is  (up  to  an  additive  con¬ 
stant):  p  =  — fog(~j^  -|,).  That  is  t  =  — fc  log  p  log{—k  log  p)  and 
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p  =  log{—k  log  p)  =  log  t  —  log  log{—k  log  p)  for  infinitely  many  ks.  This 
estimate  completes  the  proof  of  the  theorem. 
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